1600 


I 


D 


RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/056 , 019B 


DATE: 01/06/2003 ft 
TIME: 13:27:17 




Input Set : A:\5853-2 Sequence Listing .txt 
Output Set: N:\CRF4\01062003\I056019B.raw 


received 

JAN 1 5 2003 

TECH CENTER 1600(2900 


3 <110> APPLICANT: Tuomanen, Elaine I 

4 Wizemann, Theresa M. 

5 Masure, H. R. 

6 Johnson, Leslie S. 

9 <i20> SleVSvENTION: POLYPEPTIDE COMPRISING THE AMINO ACID OP AN N-TERMINAL 

10 CHOLINE BINDING PROTEIN A TRUNCATE, VACCINE DERIVED 

11 THEREFROM AND USES THEREOF 

13 <130> FILE REFERENCE: 5853-2 

15 <140> CURRENT APPLICATION NUMBER: 09/056, 019B 

16 <141> CURRENT FILING DATE: 1998-04-07 
18 <160> NUMBER OF SEQ ID NOS: 40 
20 <170> SOFTWARE: Patentln Ver. 2.0 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 406 

24 <212> TYPE: PRT 

25 <213> ORGANISM: Streptococcus pneumoniae 

27 <400> SEQUENCE: 1 = acT , 

28 Glu Asn Glu Gly Ala Thr Gin Val Pro Thr Ser Ser Asn Arg Ala Asn 

31 Glu Ser Gin Ala Glu Gin Gly Glu Gin Pro Lys Lys Leu Asp Ser Glu 

32 20 25 30 

34 Arg Asp Lys Ala Arg Lys Glu Val Glu Glu Tyr Val Lys Lys lie Val 

35 35 45 

37 Gly Glu Ser Tyr Ala Lys Ser Thr Lys Lys Arg His Thr lie Thr Val 

38 50 55 ^ 

40 Ala Leu Val Asn Glu Leu Asn Asn lie Lys Asn Glu Tyr Leu Asn Lys 

41 65 70 75 9 

43 lie Val Glu Ser Thr Ser Glu Ser Gin Leu Gin lie Leu Met Met Glu 

44 85 90 95 

46 Ser Arg Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp Ser 

47 100 105 HO 

49 Ser Ser Ser Ser Ser Ser Asp Ser Ser Thr Lys Pro Glu Ala Ser Asp 

50 115 120 125 

52 Thr Ala Lys Pro Asn Lys Pro Thr Glu Pro Gly Glu Lys Val Ala Glu 

53 130 135 l 40 

55 Ala Lys Lys Lys Val Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys 

56 145 150 155 

58 Glu Glu Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu Glu 

59 165 170 175 ^ 

61 Leu Glu He Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu 

62 180 185 190 

64 Leu Val Lys Val Lys Ala Asn Glu Pro Arg Asp Glu Gin Lys lie Lys 


file://C:\CRF4\Outhold\VsrI0560 1 9B .htm 


1/6/03 



Page 2 of 8 


RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/056 ,019B 


DATE: 01/06/2003 
TIME: 13:27:17 


65 


Input Set : A:\5853-2 Sequence Listing .txt 
Output Set: N:\CRF4\01062003\I056019B.raw 

195 200 205 

n Ala Glu Ala Glu Val Glu Ser 
68 210 215 

70 Lys Lys lie Lys Thr Asp Arg Glu 

71 225 230 

73 Arg Ala Asp Ala Lys Glu Gin Gly 

74 245 

76 Gly Val Pro Gly Glu Leu Ala Thr 

77 260 

79 Lys Ser Ser Asp Ser Ser Val Gly 

80 275 280 

82 Leu Lys Pro Glu Lys Lys Val Ala 

83 290 295 

85 Ala Lys Lys Lys Ala Glu Asp Gin 

86 305 310 

88 Pro Thr Asn Thr Tyr Lys Thr Leu 

89 325 

91 Val Glu Val Lys Lys Ala Glu Leu 

92 340 

94 Glu Pro Arg Asn Glu Glu Lys Val 

95 355 360 

97 Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Lys 

98 370 375 380 

100 Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys 

101 385 390 395 400 

103 Val Lys Glu Lys Pro Ala 

104 405 

107 <210> SEQ ID NO: 2 

108 <211> LENGTH: 655 

109 <212> TYPE: PRT 

110 <213> ORGANISM: Streptococcus pneumoniae 
112 <4 00> SEQUENCE: 2 


Lys 

Gin 

Ala 

Glu 

220 

Ala 

Thr 

Arg 

Leu 

Glu 

Ala 

Glu 

235 

Glu 

Glu 

Ala 

Lys 

Arg 

240 

Lys 

Pro 

250 

Lys 

Gly 

Arg 

Ala 

Lys 

255 

Arg 

Pro 

265 

Asp 

Lys 

Lys 

Glu 

Asn 

270 

Asp 

Ala 

Glu 

Glu 

Thr 

Leu 

Pro 

285 

Ser 

Pro 

Ser 

Glu 

Ala 

Glu 

Lys 

300 

Lys 

Val 

Glu 

Glu 

Lys 

Glu 

Glu 

315 

Asp 

Arg 

Arg 

Asn 

Tyr 

320 

Glu 

Leu 

330 

Glu 

He 

Ala 

Glu 

Ser 

335 

Asp 

Glu 

345 

Leu 

Val 

Lys 

Glu 

Glu 

350 

Ala 

Lys 

Lys 

Gin 

Ala 

Lys 

Ala 

365 

Glu 

Val 

Glu 

Leu 

Glu 

Lys 

lie 

Lys 

Thr 

Asp 

Arg 


113 

Glu 

Asn 

Glu 

Gly 

Ala 

Thr 

Gin 

Val 

Pro 

114 

1 




5 





116 

Glu 

Ser 

Gin 

Ala 

Glu 

Gin 

Gly 

Glu 

Gin 

117 




20 





25 

119 

Arg 

Asp 

Lys 

Ala 

Arg 

Lys 

Glu 

Val 

Glu 

120 



35 





40 


122 

Gly 

Glu 

Ser 

Tyr 

Ala 

Lys 

Ser 

Thr 

Lys 

123 


50 





55 



125 

Ala 

Leu 

Val 

Asn 

Glu 

Leu 

Asn 

Asn 

lie 

126 

65 





70 




128 

lie 

Val 

Glu 

Ser 

Thr 

Ser 

Glu 

Ser 

Gin 

129 





85 





131 

Ser 

Arg 

Ser 

Lys 

Val 

Asp 

Glu 

Ala 

Val 

132 



100 





105 

134 

Ser 

Ser 

Ser 

Ser 

Ser 

Ser 

Asp 

Ser 

Ser 

135 



115 





120 


137 

Thr 

Ala 

Lys 

Pro 

Asn 

Lys 

Pro 

Thr 

Glu 


10 


15 


30 


45 


60 


75 


80 


90 


95 


110 


125 


RECEIVED 

JAN 1 5 

TtCH CENTER \600|29C 
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138 130 135 140 


140 Ala Lys Lys Lys Val Glu Glu Ala 

141 145 150 

143 Glu Glu Asp Arg Arg Asn Tyr Pro 

144 165 

146 Leu Glu lie Ala Glu Ser Asp Val 

147 180 

149 Leu Val Lys Val Lys Ala Asn Glu 

150 195 200 

152 Gin Ala Glu Ala Glu Val Glu Ser 

153 210 215 

155 Lys Lys He Lys Thr Asp Arg Glu 

156 225 230 

158 Arg Ala Asp Ala Lys Glu Gin Gly 

159 245 

161 Gly Val Pro Gly Glu Leu Ala Thr 

162 260 

164 Lys Ser Ser Asp Ser Ser Val Gly 

165 275 280 

167 Leu Lys Pro Glu Lys Lys Val Ala 

168 290 295 

170 Ala Lys Lys Lys Ala Glu Asp Gin 

171 305 310 

173 Pro Thr Asn Thr Tyr Lys Thr Leu 

174 325 

176 Val Glu Val Lys Lys Ala Glu Leu 

177 340 

179 Glu Pro Arg Asn Glu Glu Lys Val 

180 355 360 

182 Ser Lys Lys Ala Glu Ala Thr Arg 

183 370 375 

185 Lys Lys Ala Glu Glu Glu Ala Lys 

186 385 390 

188 Val Lys Glu Lys Pro Ala Glu Gin 

189 405 

191 Ala Glu Lys Pro Ala Pro Ala Pro 

192 420 

194 Pro Lys Ala Glu Lys Pro Ala Asp 

195 435 440 

197 Arg Arg Ser Glu Glu Glu Tyr Asn 

198 450 455 

200 Lys Thr Glu Lys Pro Ala Gin Pro 

201 465 470 

203 Gin Glu Asn Gly Met Trp Tyr Phe 

204 485 

206 Thr Gly Trp Leu Gin Asn Asn Gly 

207 500 

209 Gly Ala Met Ala Thr Gly Trp Leu 

210 515 520 


Glu Lys Lys Ala Lys Asp Gin Lys 
155 160 
Thr lie Thr Tyr Lys Thr Leu Glu 
170 175 

Glu Val Lys Lys Ala Glu Leu Glu 
185 190 

Pro Arg Asp Glu Gin Lys lie Lys 
205 

Lys Gin Ala Glu Ala Thr Arg Leu 
220 

Glu Ala Glu Glu Glu Ala Lys Arg 
235 240 

Lys Pro Lys Gly Arg Ala Lys Arg 
250 255 

Pro Asp Lys Lys Glu Asn Asp Ala 
265 270 

Glu Glu Thr Leu Pro Ser Pro Ser 
285 

Glu Ala Glu Lys Lys Val Glu Glu 
300 

Lys Glu Glu Asp Arg Arg Asn Tyr 
315 320 

Glu Leu Glu lie Ala Glu Ser Asp 
330 335 

Glu Leu Val Lys Glu Glu Ala Lys 
345 350 

Lys Gin Ala Lys Ala Glu Val Glu 
365 

Leu Glu Lys lie Lys Thr Asp Arg 
380 

Arg Lys Ala Ala Glu Glu Asp Lys 
395 400 

Pro Gin Pro Ala Pro Ala Pro Lys 
410 415 

Lys Pro Glu Asn Pro Ala Glu Gin 
425 430 

Gin Gin Ala Glu Glu Asp Tyr Ala 
445 

Arg Leu Thr Gin Gin Gin Pro Pro 
460 

Ser Thr Pro Lys Thr Gly Trp Lys 
475 480 

Tyr Asn Thr Asp Gly Ser Met Ala 
490 495 

Ser Trp Tyr Tyr Leu Asn Ser Asn 
505 510 

Gin Asn Asn Gly Ser Trp Tyr Tyr 
525 
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212 

Leu 

Asn 

Ala 

Asn 

Gly 

Ser 

Met 

Ala 

Thr 

Gly 

Trp 

Leu 

Gin 

Asn 

Asn 

Gly 

213 


530 





535 





540 





215 

Ser 

Trp 

Tyr 

Tyr 

Leu 

Asn 

Ala 

Asn 

Gly 

Ser 

Met 

Ala 

Thr 

Gly 

Trp 

Leu 

216 

545 





550 





555 





560 

218 

Gin 

Tyr 

Asn 

Gly 

Ser 

Trp 

Tyr 

Tyr 

Leu 

Asn 

Ala 

Asn 

Gly 

Ser 

Met 

Ala 

219 





565 





570 





575 


221 

Thr 

Gly 

Trp 

Leu 

Gin 

Tyr 

Asn 

Gly 

Ser 

Trp 

Tyr 

Tyr 

Leu 

Asn 

Ala 

Asn 

222 




580 





585 





590 



224 

Gly 

Asp 

Met 

Ala 

Thr 

Gly 

Trp 

Val 

Lys 

Asp 

Gly 

Asp 

Thr 

Trp 

Tyr 

Tyr 

225 



595 





600 





605 




227 

Leu 

Glu 

Ala 

Ser 

Gly 

Ala 

Met 

Lys 

Ala 

Ser 

Gin 

Trp 

Phe 

Lys 

Val 

Ser 

228 


610 





615 





620 





230 

Asp 

Lys 

Trp 

Tyr 

Tyr 

Val 

Asn 

Gly 

Ser 

Gly 

Ala 

Leu 

Ala 

Val 

Asn 

Thr 

231 

625 





630 





635 





640 

233 

Thr 

Val 

Asp 

Gly 

Tyr 

Gly 

Val 

Asn 

Ala 

Asn 

Gly 

Glu 

Trp 

Val 

Asn 


234 





645 





650 





655 


237 

<210> SEQ ID NO: 

; 3 












238 

<2 11> LENGTH: 284 












239 

<212> TYPE: 

PRT 













240 

<213> ORGANISM: 

Streptococcus pneumoniae 






242 

<4 00> SEQUENCE: 

3 












243 

Glu 

Asn 

Glu 

Gly 

Ala 

Thr 

Gin 

Val 

Pro 

Thr 

Ser 

Ser 

Asn 

Arg 

Ala 

Asn 

244 

1 




5 





10 





15 


246 

Glu 

Ser 

Gin 

Ala 

Glu 

Gin 

Gly 

Glu 

Gin 

Pro 

Lys 

Lys 

Leu 

Asp 

Ser 

Glu 

247 




20 





25 





30 



249 

Arg 

Asp 

Lys 

Ala 

Arg 

Lys 

Glu 

Val 

Glu 

Glu 

Tyr 

Val 

Lys 

Lys 

lie 

Val 

250 



35 





40 





45 




252 

Gly 

Glu 

Ser 

Tyr 

Ala 

Lys 

Ser 

Thr 

Lys 

Lys 

Arg 

His 

Thr 

lie 

Thr 

Val 

253 


50 





55 





60 





255 

Ala 

Leu 

Val 

Asn 

Glu 

Leu 

Asn 

Asn 

lie 

Lys 

Asn 

Glu 

Tyr 

Leu 

Asn 

Lys 

256 

65 





70 





75 





80 

258 

He 

Val 

Glu 

Ser 

Thr 

Ser 

Glu 

Ser 

Gin 

Leu 

Gin 

lie 

Leu 

Met 

Met 

Glu 

259 





85 





90 





95 


261 

Ser 

Arg 

Ser 

Lys 

Val 

Asp 

Glu 

Ala 

Val 

Ser 

Lys 

Phe 

Glu 

Lys 

Asp 

Ser 

262 




100 





105 





110 



264 

Ser 

Ser 

Ser 

Ser 

Ser 

Ser 

Asp 

Ser 

Ser 

Thr 

Lys 

Pro 

Glu 

Ala 

Ser 

Asp 

265 



115 





120 





125 




267 

Thr 

Ala 

Lys 

Pro 

Asn 

Lys 

Pro 

Thr 

Glu 

Pro 

Gly 

Glu 

Lys 

Val 

Ala 

Glu 

268 


130 





135 





140 





270 

Ala 

Lys 

Lys 

Lys 

Val 

Glu 

Glu 

Ala 

Glu 

Lys 

Lys 

Ala 

Lys 

Asp 

Gin 

Lys 

271 

145 





150 





155 





160 

273 

Glu 

Glu 

Asp 

Arg 

Arg 

Asn 

Tyr 

Pro 

Thr 

lie 

Thr 

Tyr 

Lys 

Thr 

Leu 

Glu 

274 





165 





170 





175 


276 

Leu 

Glu 

lie 

Ala 

Glu 

Ser 

Asp 

Val 

Glu 

Val 

Lys 

Lys 

Ala 

Glu 

Leu 

Glu 

277 




180 





185 





190 



279 

Leu 

Val 

Lys 

Val 

Lys 

Ala 

Asn 

Glu 

Pro 

Arg 

Asp 

Glu 

Gin 

Lys 

lie 

Lys 

280 



195 





200 





205 




282 

Gin 

Ala 

Glu 

Ala 

Glu 

Val 

Glu 

Ser 

Lys 

Gin 

Ala 

Glu 

Ala 

Thr 

Arg 

Leu 

283 


210 





215 





220 






file://C:\CRF4\Outhold\VsrI0560 1 9B.htm 



Page 5 of 8 


RAW SEQUENCE LISTING DATE: 01/06/2003 

PATENT APPLICATION: US/09/056 , 019B TIME: 13:27:17 


Input Set : A:\5853-2 Sequence Listing .txt 
Output Set: N:\CRF4\01062003\I056019B.raw 


285 

Lys 

Lys 

He 

Lys 

Thr 

Asp 

Arg 

Glu 

Glu 

Ala 

Glu 

Glu 

Glu 

Ala 

Lys 

Arg 

286 

225 





230 





235 





240 

288 

Arg 

Ala 

Asp 

Ala 

Lys 

Glu 

Gin 

Gly 

Lys 

Pro 

Lys 

Gly Arg 

Ala 

Lys 

Arg 

289 





245 





250 





255 


291 

Gly Val 

Pro 

Gly 

Glu 

Leu 

Ala 

Thr 

Pro 

Asp 

Lys 

Lys 

Glu 

Asn 

Asp 

Ala 

292 




260 





265 





270 



294 

Lys 

Ser 

Ser 

Asp 

Ser 

Ser 

Val 

Gly 

Glu 

Glu 

Thr 

Leu 





295 



275 





280 










298 <210> SEQ ID NO: 4 

299 <211> LENGTH: 106 

300 <212> TYPE: PRT 

301 <213> ORGANISM: Streptococcus pneumoniae 
303 <400> SEQUENCE: 4 


304 

Lys 

Pro 

Glu 

Lys 

Lys 

Val 

Ala 

Glu 

Ala 

Glu 

Lys 

Lys 

Val 

Glu 

Glu 

Ala 

305 

1 




5 





10 





15 


307 

Lys 

Lys 

Lys 

Ala 

Glu 

Asp 

Gin 

Lys 

Glu 

Glu 

Asp 

Arg 

Arg 

Asn 

Tyr 

Pro 

308 




20 





25 





30 



310 

Thr 

Asn 

Thr 

Tyr 

Lys 

Thr 

Leu 

Glu 

Leu 

Glu 

lie 

Ala 

Glu 

Ser 

Asp 

Val 

311 



35 





40 





45 




313 

Glu 

Val 

Lys 

Lys 

Ala 

Glu 

Leu 

Glu 

Leu 

Val 

Lys 

Glu 

Glu 

Ala 

Lys 

Glu 

314 


50 





55 





60 





316 

Pro 

Arg 

Asn 

Glu 

Glu 

Lys 

Val 

Lys 

Gin 

Ala 

Lys 

Ala 

Glu 

Val 

Glu 

Ser 

317 

65 





70 





75 





80 

319 

Lys 

Lys 

Ala 

Glu 

Ala 

Thr 

Arg 

Leu 

Glu 

Lys 

lie 

Lys 

Thr 

Asp 

Arg 

Lys 

320 





85 





90 





95 


322 

Lys 

Ala 

Glu 

Glu 

Glu 

Ala 

Lys 

Arg 

Lys 

Ala 







323 




100 





105 









326 <210> SEQ ID NO: 5 

327 <211> LENGTH: 109 

328 <212> TYPE: PRT 

329 <213> ORGANISM: Streptococcus pneumoniae 
331 <400> SEQUENCE: 5 


332 

Thr 

Glu 

Pro 

Gly 

Glu 

Lys 

Val 

Ala 

Glu 

Ala 

Lys 

Lys 

Lys 

Val 

Glu 

Glu 

333 

1 




5 





10 





15 


335 

Ala 

Glu 

Lys 

Lys 

Ala 

Lys 

Asp 

Gin 

Lys 

Glu 

Glu 

Asp 

Arg 

Arg 

Asn 

Tyr 

336 




20 





25 





30 



338 

Pro 

Thr 

lie 

Thr 

Tyr 

Lys 

Thr 

Leu 

Glu 

Leu 

Glu 

lie 

Ala 

Glu 

Ser 

Asp 

339 



35 





40 





45 




341 

Val 

Glu 

Val 

Lys 

Lys 

Ala 

Glu 

Leu 

Glu 

Leu 

Val 

Lys 

Val 

Lys 

Ala 

Asn 

342 


50 





55 





60 





344 

Glu 

Pro 

Arg 

Asp 

Glu 

Gin 

Lys 

lie 

Lys 

Gin 

Ala 

Glu 

Ala 

Glu 

Val 

Glu 

345 

65 





70 





75 





80 

347 

Ser 

Lys 

Gin 

Ala 

Glu 

Ala 

Thr 

Arg 

Leu 

Lys 

Lys 

lie 

Lys 

Thr 

Asp 

Arg 

348 





85 





90 





95 


350 

Glu 

Glu 

Ala 

Glu 

Glu 

Glu 

Ala 

Lys 

Arg 

Arg 

Ala 

Asp 

Ala. 




351 




100 





105 









354 <210> SEQ ID NO: 6 

355 <211> LENGTH: 4 

356 <212> TYPE: PRT 
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Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:6; Xaa Pos. 2,3 
Seq# : 27 ; Xaa Pos. 1 
Seq#:28; Xaa Pos. 243 
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VERIFICATION SUMMARY DATE: 01/06/2003 

PATENT APPLICATION: US/09/056, 019B TIME: 13:27:18 

* 

Input Set : A: \5853-2 Sequence Listing . txt 
Output Set: N:\CRF4\01062003\I056019B.raw 

L: 366 M: 34 1 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 after pos . : 0 

L: 1099 M : 34 1 W: (46) "n" or "Xaa" used, for SEQ ID#:27 after pos.:0 
L: 1159 M: 341 W: (46) "n" or "Xaa" used, for SEQ ID#:28 after pos.:240 
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